SPALS: Fast Alternating Least Squares via Implicit Leverage Scores Sampling

نویسندگان

  • Dehua Cheng
  • Richard Peng
  • Yan Liu
  • Ioakeim Perros
چکیده

Tensor CANDECOMP/PARAFAC (CP) decomposition is a powerful but computationally challenging tool in modern data analytics. In this paper, we show ways of sampling intermediate steps of alternating minimization algorithms for computing low rank tensor CP decompositions, leading to the sparse alternating least squares (SPALS) method. Specifically, we sample the Khatri-Rao product, which arises as an intermediate object during the iterations of alternating least squares. This product captures the interactions between different tensor modes, and form the main computational bottleneck for solving many tensor related tasks. By exploiting the spectral structures of the matrix Khatri-Rao product, we provide efficient access to its statistical leverage scores. When applied to the tensor CP decomposition, our method leads to the first algorithm that runs in sublinear time per-iteration and approximates the output of deterministic alternating least squares algorithms. Empirical evaluations of this approach show significant speedups over existing randomized and deterministic routines for performing CP decomposition. On a tensor of the size 2.4m ⇥ 6.6m ⇥ 92k with over 2 billion nonzeros formed by Amazon product reviews, our routine converges in two minutes to the same error as deterministic ALS.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Statistical Perspective on Algorithmic Leveraging

One popular method for dealing with large-scale data sets is sampling. For example, by using the empirical statistical leverage scores as an importance sampling distribution, the method of algorithmic leveraging samples and rescales rows/columns of data matrices to reduce the data size before performing computations on the subproblem. This method has been successful in improving computational e...

متن کامل

The Effect of Coherence on Sampling from Matrices with Orthonormal Columns, and Preconditioned Least Squares Problems

Motivated by the least squares solver Blendenpik, we investigate three strategies for uniform sampling of rows fromm×n matrices Q with orthonormal columns. The goal is to determine, with high probability, how many rows are required so that the sampled matrices have full rank and are well-conditioned with respect to inversion. Extensive numerical experiments illustrate that the three sampling st...

متن کامل

Least-Squares Approximate Solution of Overdetermined Sylvester Equations

We address the problem of computing a low-rank estimate Y of the solution X of the Lyapunov equation AX + XA′ + Q = 0 without computing the matrix X itself. This problem has applications in both the reduced-order modeling and the control of large dimensional systems as well as in a hybrid algorithm for the rapid numerical solution of the Lyapunov equation via the alternating direction implicit ...

متن کامل

Lecture 12 : Randomized Least - squares Approximation in Practice

Recall that we are interested in the conditioning quality of randomized sketches constructed by RandNLA sampling and projection algorithms. For simplicity of comparison with the Blendenpik paper, I’ll state the results as they are stated in that paper, i.e., with a Hadamard-based projection, and then I’ll point out the generalization (e.g., to leverage score sampling, to other types of projecti...

متن کامل

Lecture 10 : Fast Random Projections and FJLT , cont

• We will describe a fast algorithm to compute very fine approximations to the leverage scores of an arbitrary tall matrix. • We will describe a few subtleties to extend this basic algorithm to non-tall matrices, which is of interest in extending these LS ideas to low-rank matrix approximation. • We will describe how to use this algorithm in a fast random sampling algorithm for the LS problem (...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016